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Abstract: For estimating a positive normal mean, Zhang and Woodroofe 
(2003) as well as Roe and Woodroofe (2000) investigate 100(1 - a)% HPD 
credible sets associated with priors obtained as the truncation of noninfor- 
mative priors onto the restricted parameter space. Namely, they establish the 
attractive lower bound of for the frequentist coverage probability of these 

procedures. In this work, we establish that the lower bound of is appli- 

cable for a substantially more general setting with underlying distributional 
symmetry, and obtain various other properties. The derivations are unified 
and are driven by the choice of a right Haar invariant prior. Investigations 
of non-symmetric models are carried out and similar results are obtained. 
Namely, (i) we show that the lower bound still applies for certain types 

of asymmetry (or skewness), and (ii) we extend results obtained by Zhang and 
Woodroofe (2002) for estimating the scale parameter of a Fisher distribution; 
which arises in estimating the ratio of variance components in a one-way bal- 
anced random effects ANOVA. Finally, various examples illustrating the wide 
scope of applications are expanded upon. Examples include estimating param- 
eters in location models and location-scale models, estimating scale param- 
eters in scale models, estimating linear combinations of location parameters 
such as differences, estimating ratios of scale parameters, and problems with 
non-independent observations. 



1. Introduction 

For a lower bounded normal mean 9 (say 9 > a) with unknown standard devi- 
ation a, and for independent observables X and W with X ^ N(6', cr^), W ~ 
Gamma(|, 2(7^), Zhang and Woodroofe 9] investigate 100 x (1 — a)% highest pos- 
terior density (HPD) credible sets 1,^ {X, W) associated with the (improper) prior 
density tto{9, a) = ^l[a,oo)(6')l(o,oo)(o'). Using the posterior density 9\{X, W), which 
brings into play a truncated Student pdf, they begin by constructing 
lTro{X,W) as the 100 x (1 — a)% Bayesian interval where the posterior density 
is the largest. 

Then, attractive features of the frequentist coverage of the Bayesian confidence 
interval /ttq (-^i W^) £^re established. In particular, they show that 

1 - a 

(1) Pe,.(/.„(^, W) contains d) > -—, 

1 + a 

^Departement de mathomatiquos, Universite de Sherbrooke, Sherbrooke Qc, CANADA, 
JIK 2R1, e-mail: |er ic . mar chandOusherbr ooke . ca 

^Department of Statistics, Rutgers University, 501 Hill Center, Busch Campus, Piscataway, 
NJ 08855, USA, e-mail: strawOstat .rutgers . edu 

AMS 2000 subject classifications: 62F10, 62F15, 62F30, 62F99. 

Keywords and phrases: restricted parameter space, Bayesian methods, confidence intervals, 
frequentist coverage probability, unimodality. 



112 



On the behavior of Bayesian credible intervals 



113 



for all {0, a) such that 9 > a and cr > 0. 

For the case of a known standard deviation, similar developments were given 
previously by Roe and Woodroofe 6]. Analogously to ([T|), they obtain that 

1 - a 

(2) Peil^^iX) contains 9) > , 

I + a 

for all 9 > a; where /^^(X) is the HPD credible set associated with the prior 
"uniform" density TTir{9) — l[a.oo)(^)- Interestingly, for the estimation of the ratio 
of variance components in a one-way balanced model analysis of variance with 
random effects, Zhang and Woodroofe obtain results of the same nature. 

The objective here is to present extensions of ([1]) and ^ to other probability 
models, as well as generalizations to other restricted parameter space scenarios. A 
notable feature resides in the universal resonance, for symmetric models and for 
certain types of asymmetric models, of the lower bound As well, additional 

frequentist properties of the studied credible intervals are obtained. Although the 
methods of proof follow for the most part those in the above mentioned papers of 
Roe, Woodroofe and Zhang, it is particular interesting that the methods of proof 
are unified. Moreover, we actually offer a useful simplification. 

Inference problems for constrained parameter spaces has, for many years, held 
the interest of statisticians. Correspondingly, as reviewed by Marchand and Straw- 
derman Q or van Eeden it has been a fairly active field. Recently though, there 
has been a renewed interest from the particle physicist community with high energy 
experiments leading to constrained parameter models (see for instance [2, 
and more specifically to the problem of setting confidence bounds in the presence 
of constrained parameters. Actually, a vigorous and substantial debate has arisen, 
focussing indeed on the choice of method, with an underlying Bayesian-frequentist 
comparison of the respective advantages and disadvantages (e.g., |4i]). 

As an example for the normal model above with known variance, it has been 
observed that the so-called "unified method" put forth by Feldman and Cousins ^ ; 
which is a frequentist based method arrived at by the inversion of a LRT and which 
leads to exact frequentist coverage; produces "quite short" intervals for small values 
of X, in comparison at least to Itt^^X). Such observations are not surprising since 
the methods differ in how they take into account the lower-bound constraint. As 
argued by Zhang and Woodroofe @ and Roe and Woodroofe , the HPD credible 
intervals /^g {X, W) and {X) are quite sensible ways to deal with the lower 
bound constraint. If such is the case, then good frequentist coverage properties of 
these Bayesian confidence intervals would render them more attractive, even from a 
frequentist point of view. There lies as well an intrinsic interest in these procedures 
given that the untruncated versions of the priors ttq and ttu lead to the usual, and 
introductory textbook, t and z two-sided 100 x (1 — a)% intervals; which of course 
have exact coverage. 

The paper is organized as follows. Symmetric models are treated in Section 2, 
while asymmetric models are reserved for Section 4. The main finding of Section 2 
relates to the choice of the truncation of the Haar right invariant prior for a large 
class of problems, with underlying unimodality and symmetry, which leads to the 
lower bound for the frequentist coverage probability of the associated HPD 
credible set. Various other corollaries are available. For instance, an exact coverage 
probability of for boundary parameter values is established, and the above 
lower bound is shown to hold for a Bayesian confidence interval which is 
not HPD (see Remark 1, part c). Various examples, illustrating the wide scope 
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of applications, are expanded upon in Section 3. The developments for asymmetric 
models is more delicate requiring a categorization of different types of asymmetry. 
In cases where the underlying models' density is not monotone, the lower bounds 
obtained in general are less explicit, but there is evidence that these lower bounds 
can be quite large. Moreover, the last result (Corollary 2) actually recovers the 
lower bound for certain types of underlying skewness, as a generalization of 
the symmetric case. 



2. Symmetric models 

We first consider models with an observable scalar or vector X having densities 
f{x;6); G A C for which there exists a lower bound constraint of the form 
t{6) > 0; t{6) : 3fJP — > 3fi. Moreover, we work with a structure, which is present 
in previous work described above, and where there exists a linear pivotal quan- 
tity of the form with underlying absolutely continuous, symmetric and 
(strictly) unimodal density. An immediate example consists of symmetric and uni- 
modal location densities f{x; 9) — fo{x — 9), with t{9) — 6 > and the pivot X — 9. 
Further examples are presented in Section 3. 

We study HPD credible intervals I-^g{X), based on {ai{X), a2{X)), associated 
with a prior ttq obtained as the truncation onto the parameter space {9 : t{9) > 0} 
of a Haar right invariant density 7t{9). To describe the construction of this interval 
as well as several others that follow, it is useful to define the following quantities. 

Definition 1. For a given continuous cumulative distribution function i^, a G 
(0, 1), y e 3?, we define: 

d,{y) = F'\l - aF{y));d2{y) = F-\^ + ^F{y)); and 
d{y) = ma.x{di{y),d2{y)). 

In situations where we wish to emphasize the dependence of the above functions 
on the pair (F, a), we will write instead dip ^ , d2p „ , and dp^a. To a large extent, the 
frequentist properties which we establish below depend on the following property 
of dF,a', which is easily established. 

Lemma 1. For all {F,a), we have d{y) > d{dQ) = do, with do = F~^{y\:^)- 
well, d{y) = di{y) if and only if y < d^. 

Proof. A direct evaluation tells us that dg = F~^{j^) is a fixed point of di, ^2, and 
hence of d. The result follows as di(y) decreases in y, and c?2(?/) increases in y. □ 

The following theorem is our first key result. Paired with Corollary[Tl it will lead 
to various applications which are generalizations of ([1]) and ([2]). 

Theorem 1. For a model X\9 ^ f(x;9), and a parametric function t{9) : 3?^ — > 
3? such that t{9) > (constraint); suppose there exists a linear pivot T[X,9) = 
'^^''a2(x)'^^ ; with a2(-) > 0; such that the distribution ofT{X, 9)\9 is given by cdf G, 
with pdf G' which is symmetric and unimodal (without loss of generality, about 0). 
Suppose further that there exists a prior 7t{9) supported on the natural parameter 
space such that: 



(3) 



T{X,9)\x ^'^ T{X,9)\9; 
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(i.e., the frequentist distribution ofT{X,9) for a given 9; which is independent of 
9 and given by cdf G; matches the posterior distribution of T(X, 9) for any given 
value X of X). Then, for the prior t:q{9) — t^{9)1[o,oo){t{9)) , we have: 

(a) I^„{X) = [l{X),u{X)], withl{X)^m(ix{0,aiiX)-dGA^)<^2iX)} and 
u{X)^a,{X) + dGA^h2{X); 

(b) Pe{I^„{X) 3 t{9)) > i^, for all 9 such that t{9) > 0; 

(c) Pg{I^„{X) 3 0) = j^, for all 9 such that t{9) = 0; 

(d) \im,(e)^^Pe{hM) 3 ^W) = 1 - «0 

Proof, (a) Denote h^, H^, and H^^ as the pdf, cdf, and inverse cdf of the 
posterior distribution of t(9) under ttq. Since T{X,9) is a pivot, implying that 
its distribution is, for any given 9, free of 9, we infer from ([3|) that, for 9 ^ ir, 
PAT{X,9) < y\x) = G{y) , or equivalently P^{t{9) > y\x) = G{ii^) . By 
definition of ttq, this gives us for y >0 

Hx{y) = Pno{T{9) < y\x) = 1 - = 1 - 



PArie) > 0\x) G(^) 

\ 09 la; 1 ' 



^ 02(2;) ' 

and 

H-\A) = a,{x) - a2{x)G-\{l - A)G(^)). 

a2{x) 

Now, observe that the posterior density is unimodal, with a maximum at 
max(0, ai(x)). From this, since our HPD credible interval may be represented as 
{t{9) : hx{T{9)) > c} for some constant c (e.g., jlj, page 140), we infer that either: 

(i) l{x) = and u{x) = H^i^ ~ a) = ai{x) - a2{x)G'^{aG{^^) ), or 

(ii) l{x) = ai{x) — b{x) and u{x) = ai{x) + b{x): for some b{x) such that ai{x) — 
b{x) > 0. 

From the symmetry of G", we have in (i): u{x) = ai{x) + a2{x)G^^ x 
(1 — a G{ ) ) = ai{x) + 02(2;) dig „ For (ii), we obtain also with the 

symmetry of G" that: 

P„„{ai{x) - b{x) < t{9) < ai{x) + b{x)\x) = l-a 
^ Hx{ai{x) + b{x)) - Hx{ai{x) - b{x)) = 1 - a 

^ G — -- =(l-aG— -- 

a2(x) a2[x) a2{x) 

^ G(-^) = i[l + (1 - a)G{^)] (by symmetry) 
a2[x) 2 a2(x) 

^ bix)=a2{x)G 1 - + ^— ^G — ^ =a2 X d2o.„ — K • 
2 2 a2(x) a2(X) 

Moreover, situation (ii) occurs iff 

ai{x) > a2(a;)d2 (_^)^G(— ^) > - + ^— G(^^) 
, . a2[x) a2[x) 2 2 02(2^) 

02(2;) 1 + a 



'^More precisely, we are referring of course to a sequence of 9^s;j = 1,2,... such the corre- 
sponding T((?i)'s have a limiting value of +00. 
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Finally, the result follows by combining (i) and (ii) and using Lemma [TJ 

(b) First, observe that the interval ai{X) ± rfG,a( a^lx) ) '^'^i-^) the same 
coverage probability as its subset /tt^ (X) for nonnegative values of t{9), since the 
difference of these two sets can only help in covering negative values of t{9). Now, 
along the lower bound do of Lemma [1] and the symmetry of G", we have for 0's such 
that T{e) > 0: 

Peil.aiX) 3 t{9)) = Pg{a^{X) - a2(X)dG,a(^4^) 

a2(,X) 

<T{9)<a,{X) + a2{X)dGA'^^)) 

a2\^ ) 

= PemXM<dGA^^.)) 
a2[X) 

>Pg{\T{X,0)\<do) 

= 2G(do) - 1 = 2G(G-i(-^)) - 1 = ^ . 

L + a 1 + a 

(c) Since coverage at t{9) — occurs if and only 1{X) — 0, we have by ^ for 
6l's such that t{9) = 0: 

Peil.oiX) 3 0) = PeiliX) = 0) = P,(^ii^l^ < do) = G(do) - ■ 

a2[X) l + a 

(d) Since is a pivot, implying that ^ oo" and G(^^) con- 
verges to 1 in probability as t{9) — > oo, it follows that dG,a( °^|^) ) (equal to 
G~^(| + ^-^Gi al\x] )^ large ) converges in probability, as t{9) oo, 
to G^^(l — j). In view of the above, and as in part (b), we have 

hm Pe(/.,(X)9r(0))= lim P,(|T(X, ^?)| < 

r(S)— >oo t(0)^oo (12(^ ) 

^Pomx,9)\<G-'{l-^)) 

= 2G(G-i(l-|))-l = l-a. □ 

Observe how critical ^ is, namely in the last line of the proof of part (b) where 
the identity G(G~^) arises. In fact, the "G~^" comes from the construction of 
lTrg{X) (hence the Ihs of ([31)), while the "G" comes from the frequentist coverage 
assessment of /7r(,(X) (hence the rhs of©). Condition Q may appear stringent but, 
as shown below, it is attainable for a large class of problems if the prior Tr{9) is Haar 
right invariant (informally, a prior leaving the measure of sets constant under certain 
transformations). For instance, consider a simple location model X ^ fo{x~9) with 
known /q. Set Z = X ~ 9 and consider the flat prior Tr{9) = 1. It is easy to verify 
that for any pair {x,9), the distributions Z\9 and Z\x match with density /o(-)j 
which tells us that condition ([3]) holds here with the choice of the flat (right Haar 
invariant also) prior. (It is important to note that the assumptions of symmetry 
and unimodality are additional and specific to Theorem [l] and are not required for 
the above illustration of This is exploited namely in Section 4 (also see Remark 
1, part c) where we make use of condition 

The various applications (see Section 3) which will follow from Theorem [1] are 
essentially all cases where the prior tt{9) is Haar right invariant (denoted tt^{9)) and 
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the pivot satisfies the invariance requirement T{x,9) = T(gx,g9), for all x £ X, 
9 £ Q, g & G, g £ G, with X, 9, G, and G being isomorphic ("equivalent"). We 
now pursue by showing how this invariance requirement and conditions lead to Q , 
hence permitting the application of Theorem [1] for a given problem. We make use 
of the following result (and notation) given in [1]. 

Lemma 2 (Result 3, p. 410 1]). Consider an invariant decision problem for which 
X , Q, G, and G are all isomorphic. Then, for an invariant decision rule S{x) — 
x{a), 

(5) i?-''(^l-){L(0,i(a))} = i?(0,5(X)), 

where 'k^(9\x) is the posterior distribution with respect to the right invariant (gen- 
eralized) prior density 7t^{9). 

Corollary 1. Suppose X, Q, G, and G are all isomorphic, and that T{X,9) is 
a function for which T{x,9) = T{gx,g9), for allx£X,9CzQ, gCzG,gGG. 
Then condition © holds, that is Pg[T{X,9) e A] = [r(X, 61) € A] for each 

measurable set A (where the Ihs gives the frequentist distribution of T{X,9) for 
given 9, and the rhs gives the posterior distribution ofT{X, 9) conditional on X for 
the right invariant Haar measure). 

Proof. It suffices to establish, for each measurable set A (in the range of T{X, 9)), 
the identity: 

(6) Pe{T{X, 9)eA)^ P"" ^'^^''\t {X , 9) e A). 

To do so, we apply Lemma [5] for loss Lp^{9,d) — lp^{T{d,9)), and for 5{X) = X. 
With G — G*, we indeed have that S{X) is an equivariant decision rule since 
5{gX) = gX = g*{X) = g*{6{X)). We also have by assumption on T: 

Lj^{g9,g*d) = Lj^{g9,gd) = lj^{T{gd, g9)) = lj^{T{d, 9)) = Lj^{9,d), 

which tells us that we have an invariant decision problem. Finally, applying LemmaH] 
yields ([6]) and establishes the Corollary. □ 

Now, prior to presenting various illustrations and applications of Theorem[T](and 
Corollary[T]) in Section 3, we conclude this section by expanding on some interesting 
aspects and implications of the results above. 

Remark 1. (a) Exact values or very good approximations of the frequentist cov- 
erage probability of /^g, which seem difficult to establish, are not provided 
explicitly by the results above. The exceptions are at the boundary where the 
probability of coverage exceeds the nominal coverage probability 1 — a, 
and when t(9) — > oo where the coverage probability tends to I — a. Numerical 
evaluations are provided, for the normal models described in the introduc- 
tion, by Roe and Woodroofe 0, and Zhang and Woodroofe P. Moreover, 
as pointed out in these manuscripts for a normal model G, and as suggested 
by the derivation above, the lower bound is, for a specific G, somewhat 
conservative. But it has the advantage of being simple and derived in a uni- 
fied fashion, applicable for a vast array of situations, and for quite general 
symmetric and unimodal densities G'. 
(b) In addition, the above development can be adapted to deal with the following 
robustness issue. Indeed, suppose that the actual model is governed by sym- 
metric pdf's fi{x;9), with corresponding cdf's Gi, in contrast to the bounds 
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which are set using G. Then, foUowing the proof of Theorem [T](b), above, we 
have 

PeihAX) 3 m) > Po{\T{X,0)\ < do) = 2Gi(do) - 1; 

which provides lower bounds or envelopes depending on Gi. Moreover, the 
quantity remains a lower bound on the probability of coverage for a given 
Gi as long as, simply, 

(7) Gi(do) > G{do); (with do = G-i(— |— )). 

1 + a 

Here, various properties of families of distributions can be elucidated to give 
realizations of d?]). For instance, d?]) holds as long as is nonincreasing 

in y;y > 0; or as a specific case if G[{y) = iG'(^), cr < 1, in other words /i 
and /o belong to the same scale family having increasing monotone likelihood 
ratio in \y\. 

(c) Interestingly, in the case of continuous but non-unimodal G', the above de- 
velopment remains valid with the difference that the interval /^p(X) is not 
HPD, in other words iTrg(X) is a credible interval with the same frequentist 
properties as those given in Theorem [U but it is not (necessarily) optimal in 
the sense of being the credible region with the shortest length. 

3. Examples 

We enumerate a list of situations for which Theorem [1] applies. The list is also illus- 
trative in the sense that we also specify components, such as the pivot T{X, 6) and 
the prior ttq of Theorem [TJ In all cases below with unimodal and symmetric den- 
sity G' , the lower bound applies for the coverage probability of the confidence 
interval I^g(X). In cases where the density G' is unimodal but not symmetric, the 
results of Section 4 will also apply to each one of the following situations as well. 

(a) (location) X ~ fo{x—9); fo unimodal and symmetric; t{9) — 9 > 0; T{X, 9) = 
X — 9; ■Ko{9) = l[o,oo)(^')- For example, this appfies for a N{9,a) model known 
a and 9 > 0; but also to many other common univariate symmetric models 
such as Logistic, Laplace, Cauchy and Student, etc. 

(b) (location-scale) (X.^X^) ^ /o(^,ff); r{9) = 9, > 0; T{X,9) = 

T^o{&) — ^l(o,oo)(^'2)l[o,oo)(^i)- Observe that T{X,9) is indeed a pivot here 
as it can expressed as the ratio of the elements of the pair ( ' "if ) ' 
whose distribution is free of (0i,6'2). An important case here arises with 
the model Yi, . . . ,Yn i.i.d. N{9i, (^2)^), and for which the sufficient statistic 

— g 

{Xi,X2) = {Y, admits a location-scale model as above with the distri- 
bution of T{X, 9)\9 being Student with n — 1 degrees of freedom. 

(c) (multivariate location) X = {Xi, . . . , Xp) ^ fo{xi ~ 9i, . . . ,Xp ~ 9p); t{9) = 
ELi«^^*; TiX,9) = (ELiO^^O - r{9);noi9) = 1[o,oo)(t(0)). For example, 
take X - Np{9,T,); E known; in which case T{X,9)\9 - N{0,a'T.a), with 
a' — (fli, . . . jflp). An important case here (and in (d) as well) concerns the 
estimation of the difference of two means 9i — 92, with the information that 
Ol > 92. 

(d) (multivariate location-scale with homogeneous scale) 

X = iXi,...,Xp,Xp+i) ^ /o(^,...,^,|^); r{9) = ELi«^«^; 
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nX.e) = ^^^-\'J \ MO) = 5^1(0.00) (ep+i)l[o,oo)(r(0)). For ex- 
ample, consider {Xi, . . . , Xp)' and Xp+i independent with (^i, . . . , Xp) ~ 
Np{{0i, . . . , 6*^), and Xp_^_-^ ~ Gamnia(r/2, 20p_,_j), in which case the 

distribution of T{X,9)\9 is distributed as {(ELi "^it^i dis- 

tributed Student with r degrees of freedom. 

(e) (scale with support being a subset of 3?+ or 3?+) X ^ |/i(^); t{9) = log{9) — 
log(a) > 0; TiX,9) = log(f ) (i.e., ai(X) - log(X) - log(a),a2(X) = 1); 
7ro(^) = ^^[o,od){t{S))- The constraint on t((?) corresponds to a lower bound 
constraint on 9, and confidence intervals for t{9) provide confidence inter- 
vals for 9, with corresponding frequentist coverage probabilities. As a specific 
example, consider a lognormal model with scale parameter 9; 9 > a{> 0); 
where ^ ~ e''^, Z ~ N(0, 1), and 6 being a known and positive shape pa- 

(log y)^ 

rameter. Here /i(j/) — (v^ttJ?/)^ e ~2l5 l(o,oo)(!/)i ^^'i tti^ distribution of 
T(X, is normal with mean and standard deviation 6. Additional ex- 
amples arise from scale models such that ^ and (^)~^ are equidistributed 
which implies symmetry for the distribution of the pivot T{X,9) = log(^). 
Further specific examples where ^ and {^)~'^ are equidistributed include the 
half-Cauchy with fi{y) — f (1 + y^)^^l(o,oo)(y)j and Fisher distributions with 
matching degrees of freedom in both numerator and denominator. On the 
other hand, if X ~ Gamma(a, 6*) for instance, then the distribution of log(^) 
is not symmetric (for any a), but the results of Section 4 apply nevertheless 
(see Example [2]). 

(f) (multivariate scale) {X,,...,Xp) ^ (Jitij-) ' ' ' ' I;); ^(^) = 
^i=i log(0i). For instance in correspondence to the problem of estimating 
the ratio of two scale parameters under the lower bound constraint ^ > a, 

Ti9) log(02) - log(0i) - log(a); TiX,9) - log(^) - log(|^) - log(a) (i.e, 
ai{X) = log(^) - log(a),02(X) = 1); 7r{9) = g^l[a^oo)i^)- Specific exam- 
ples here arise whenever Xi and X2 are independent with the distributions of 
log(^); 1 = 1,2; being symmetric (see part (e) above). Hence, Theorem[T]can 
be applied for instance to estimating a lower-bounded ratio of two lognormal 
scale parameters. 



We note that none of the above situations requires independence between the 
vector components (and see Example (g)). Theorem [1] applied to Example (a)and 
(b) extends the results of Roe and Woodroofe Zhang and Woodroofe 
obtained for the normal case. The asymmetric case studied by Zhang and Woodroofe 
(2002) which deals with a Fisher distribution is contained in part (e) (here Theorem 
[2] and perhaps Corollary [2] apply). Numerical displays of /^^(Ar) and of its coverage 
probability, in comparison namely to other confidence interval procedures, are given 
in the above papers, as well in [J]. 

The developments above are neither limited to samples of size 1 of AT, nor to cases 
where X is a sufficient statistic. Further applications are available by conditioning 
on the maximal invariant V. For instance, the results are applicable for location 
parameter families with densities f{xi ^ 9, . . . , Xn ^9), provided the conditional 
distribution of X„ — X]"=i -^i given the maximal invariant v = (xi — a;„, . . . , Xn-i — 
Xn) satisfies the conditions required for G (a.e. v). We conclude this section with 
an illustration with spherically symmetric models, and specifically to a multivariate 
student model. 
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(g) (sample of size n with underlying spherically symmetric distribution) Suppose 
the distribution oiX — {Xi, . . . , X„) is spherically symmetric about {9, ... ,9) 
with density f{x;9) — h{J2"^iixi — 6*)^), or equivalently, 

n 

(8) fix; 9) = h{n{xn - 9f + Y,{x, - x^f). 

1=1 

Considering now the pivot Z — T{X, 9) = Xn — 9 and the maximal invari- 
ant V = {Xi — Xn, ■ • • , Xn-i — Xn), Theorem [1] applies for the procedure 
Itto{^, V) which is constructed as in part (a) of Theorem [1] but with the cdfs 
Gy associated with the conditional distributions Z\V = v, or equivalently by 
virtue of ^ with the conditional pdfs 



(9) G'zi^iz) ^ h{nz^ + B{v)); 

with B{v) = v'iln^i + n')v, 1' = (1, . . . , 1); as B{v) = 'Etli^^ " The 
key points being that the conditional distributions Z\V = v are free of 9, and 
that the bounds on conditional coverage associated with G„ are free of i;. As a 
specific example, consider a multivariate Student model for X = (Xi, . . . , Xn) 
with d degrees of freedom, location parameter {9, ... ,9), scale parameter a, 
such that h{y) oc (1 + ^)^'^^' in (©• An evaluation of ^ tells us that 
W « (1 + lii^r^^^ « (1 + 1^)-^"^^ with = d + n - 1 and 
'-^'^ ^ 7t(d+?f-ij - -'-^ other words, the conditional cdfs Gz\v, which are used 
to construct /^p {X, v) , are those of a univariate Student distribution with 
degrees of freedom d + n — 1 and scale parameter cr' = \/ n{d+n-^ ■ 



4. Asymmetric models 

Here, we investigate and extend the results of Section 2 to unimodal, but not 
necessarily symmetric densities. However, as illustrated with the next example, 
unified lower bounds on the frequentist coverage probability, such as those given in 
Theorem [1] are not possible and conditions on the type of asymmetry are required. 

Example 1. Consider an exponential location model with density e^'^^^^H(o,oo)(2;— 
9); and 9 > 0. For the uniform prior Tra{9) = I[o,^)(9), the (1 - a) x 100% HPD 
credible interval is given by /^^^(A) = [l{X),u{X)], with l{x) — log(l — a + ae^) 
and u{x) = x. Observe that the interval never covers the value ^ = 0, so that the 
coverage probability Po(-^7ro (-^) 9 0) is equal to 0. Hence, a very different situation 
arises in comparison to the case of symmetric G's. Moreover, it is easy to establish 
that 

Peil.oiX) 3 9) = Pe{9 > log(l - a + ae^)) 

= Po{X <\og{l+'—^)) 
a 

= 1 - e-(i°s(i+4^)-e) 

e« - 1 
= (1-^) ^ , 1 ■ 
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Hence, the coverage probability can be quite small and never exceeds the nominal 
coverage level 1 — a. □ Finally, as one may anticipate, the same characteristics will 
arise for more general models with a property of monotone decreasing densities (see 
Theorem [21 part b). 

As in Theorem[l]and Corollary[l] the results below apply to models X\9 ^ f(x;9) 
and for estimating t(6') under the constraint t(6') > 0. 

Assumption 1. We assume again the existence of a linear pivot T{X,6) = 
(x^^^^ '^uch that -T{X,e) has cdf G, with (strict) unimodal G". Moreover, 
we assume without loss of generality that the density G' has a mode at 0. 

The confidence interval procedures studied are HPD credible based on (ai(X), 
02 (A")), and associated with the truncation ttq of the Haar right-invariant tt'" onto 
the constrained parameter space; i.e, ttq{9) = n^{9)I[Q^^-^{T{9)). We pursue with 
the introduction of various quantities and related properties which will help in 
describing the (1 — a) x 100% HPD credible interval /ttq (A), as well as some of its 
frequentist properties. In particular, as illustrated above in the contrasting results 
of Example [T] and of Theorem [Jl and since the frequentist properties which we 
can hope to establish depend on the type of asymmetry present, we breakdown, in 
Definition [4] and Corollary [21 these asymmetries into different relevant types. This 
is achieved in part with the introduction of the function C/c.a in Definition[3]below; 
which will also relate to familiar qualitative features such as skewness to the right 
(see Corollary [2]) . 

Definition 2. For cdf G with unimodal at density G", and A € (0,1), define 
7i(A) and 72(A) as values that minimize the length I71 + 72 1 among all intervals 
[-71,72] such that 0(72) - G(-7i) = A. 

Observe that the above defined 71(A) and 72(A) are indeed uniquely determined, 
and nonnegative given the unimodality. Furthermore, note that if G(0) G (0, 1), then 
we also have G'(-7i(A)) = G'(72(A)). 

Definition 3. Let 1 — a E (0,1) and G be a cdf with unimodal density G' with a 
mode at 0. Let 

(10) UgAv) - -y + 7i((l - a)(l - G{~y))); 

be defined for values y such that —y belongs to the support of G', (i.e., y S 
(_G-i(l),-G-i(0))). 

Definition 4. Let 1 — a e (0,1). Let Ci, C2, and C3 be classes of cdfs G with 
unimodal at densities G' such that 

Ci = {G : there exists an interior point yo G (— G^^(l), — G^^(O)) 

such that UG,a{yo) — 0} 
C2 = {G : Ucaiy) > for all -y on the support of G'} 
C3 — {G : Ucaiy) < for all -y on the support of G'} 

Lemma 3. In the context of Definition^ the classes Ci, C2, and C3 can alter- 
natively be described as Ci = {G : G(0) G (0,1)}, C2 = {G ; G(0) = 0}, and 
C3 = {G:G(0) = 1}. 



^On the other hand, the coverage rises fast as 8 increases and attains, for instance, Theorem 
[lis lower bound ^7" as soon as 8 = log 2. 



122 



E. Marchand and W. E. Strawderman 



Note. In other words, the class C2 consists of decreasmg densities G"; the class C3 
consists of increasing densities G' , and Ci consists of densities G' which increase on 
3?^ and decrease on 3?+. 

Proof. First observe that 

UGAy)\y=~G-H0) - G-i(O) + (71(1 - a)) < 0, 
with equality iff G-^{0) = 71(1 - a) = 0, i.e., G(0) = 0. Similarly, 
UGAy)\v=-G-Hi) - G-\l) + 7i(0) - G-i(l) > 0, 
with equality iff G(0) = 1. From these properties, we infer that 

(i) G(0) e (0,1) ^ G e Ci; 

(ii) G G C2 ^ G(0) = 0; 

(iii) G e C3 ^ G(0) = 1. 

Furthermore, if G(0) = 0, then 71 = and for such G's: UgAv) — ~y — ^^r all 
values y < — G~^(0) = 0, implying that 

(iv) G(0) = ^ G e C2. 

Also, if G(0) = 1, then -71(1 - a) = G-i(a) and G(-7i((l - a)(l - G(-y))) = 
a(l - G(-y)) + G(-y) > G(-?/); telling us that UgAv) < for aU y > -G-i(l), 
and implying that 

(v) G(0) = 1 ^ G e C3. 

Finally the converse of (i) follows from (iv) and (v). 

Although j/o depends on (a, G), we will not stress this dependence unless neces- 
sary. Here are some useful facts concerning Definition [Us yo . □ 

Lemma 4. (a) For G E Ci, we have UgAv) < iff V > Vo; 
(b) Furthemore, we have 

(11) 72((1 - a)(l - G(-j/o)) = G'\{\ ~a)+ aG{-y^)). 

Proof, (a) We prove the result for y > yo only, with a proof for y < yo following 
along the same lines. We want to show that UgAv) < ^ y > yo, i-e., 

(12) _y<_^,((i_a)(i_G(-y))). 
Define 

A = G(-yo) - G(-2/) 

Si = G{~yo) - G(-7i((l - «)(1 - Gi-y)))) and 

= G(72((l - a)(l - G(-y)))) - G(72((l -«)(!- G(-yo)))). 

Observe that ^ > since — y < — yo- Since the quantities 72(-z), 1 — G{—z), and 
G(z) are all increasing in z, it follows as well that B2 > 0. Now, with the definition 
of 71 and 72, and the identity UgAvo) = 0, we have 

Si + S2 = (1 - a)(l - G{-y)) + G(-yo) - G(72((l - a)(l - G(-yo)))) 
= (l-a)(l-G(-y)) 

+ G(-7i((l - «)(! - G(-yo)))) - G(72((l - a){l ~ G(-yo)))) 
= (1 - a)(l - G{-y)) - (1 - a)(l - G(-yo)) 
= (1 - a)(G(-yo) - G{-y)) ^{l~a)A< A (since A > 0). 
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Finally, the inequality Bi < A \s equivalent to p2)l and establishes part (a) for 
y > yo- 

(b) Using the identity ?7g,q(?/o) = and Definition [21 we have directly 

^yo = -7i((l-a)(l-G(-yo))) 

^ G(-yo)=G(-7i((l-a)(l-G(-2/o)))) 

^ G{-yo) = G(72((l - a)(l - G(-yo)))) - (1 - a)(l - G(-yo)) 
^ 1 - a + aG{-yo) = GM{1 -")(!- G(-yo)))); 

which is indeed equivalent to (fTTjl . □ 

Lemma 5. Under Assumptions^ 

(a) f/ie (1 — a) X 100% iJPD credible interval Ij^g{X) is of the form [1{X), u{X)] 
with either: 

(13) (i) l{x)^0,u{x)^ai{x) + a2{x)G-\l-a + aG{-^^)); 

a2[x) 

or 

(ii) l{x) = a,{x) - a,{x)j,{{l - a)(l - G(-^))), 

(14) «2(a;) 

= ai(a;) + a^ixh^Hl ~ a){l - G(-^))); 

(b) Furthermore, (i) occurs iff G ^ C2 or G £ Ci with < yg/ (^a^Jcf equivalently 
(ii) occurs iif G e C3, or G G Ci wit/i > yo)- 

Proof. Part (b) follows from ([M]) . the definition of the classes Ci;i = 1,2,3; and 
Lemma m To establish part (a), proceed as in the proof of Theorem [T] by denoting 
Hx, and as the cdf, and inverse cdf of the posterior distribution of t{9) under 
ttq. Since --T{X,9) is a pivot with cdf G, implying that its distribution for any 
given 9 is free of 9, we infer from ^ that, for 6* tt, P^(T{X, 9) > y\x) = G{~y) , 
or equivalently Pt^{t{9) < y\x) = G(^^^|^y^) . By definition of ttq, this gives us for 
2/ > 0, 

P^ r 6' >0x) i_G 

and 

H-\A) = ai(x) + a2(2;)G-i(A + (1 - A)G(-^)). 

Now, observe that the posterior density t{9)\x (cx G'(^^^^^^y^)/[o,oo) (?/)) is uni- 
modal, with a maximum at max(0, ai(a;)). Hence, we must have either: (i) l(x) = 
0, u{x) = H-^(l - a) yielding or (ii) 

Hx{u{x)) — Hx{l{x)) = 1 — a, with u{x) — l{x) minimal 

^, u{x) -aijx) l{x) - aijx) ai{x) 

a2[x) a2[x) a2[x) 

with u{x) — l(x) minimal, 

yielding (fT4|l by definition 71 and 72 (see Definition [2]) . □ 
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Theorem 2. Under Assumptionl^ we have 

(a) For G e C2, Pe{U„{X) 3 t{9)) > 1 ~ a for all 9 such that t{9) > 0; and 
Pg{I^„{X) 9 0) = 1 for all 9 such that t{9) = 0; 

(b) For G e C3,P0{Uo{X) 3 t{9)) < 1 - a for all 9 such that t{9) > 0; and 
Pg{I^„{X) 3 0) = for all 9 such that t{9) ^ 0; 

(c) For G e Ci,Pg{I^„{X) 3 t{9)) > (1 - a){l - G{~yo)) for all 9 such that 
t{9) > 0; and (X) 9 0) = 1 - G(-yo) for all 9 such that t{9) = (with 
Ho given in Definition^; 

(d) For unimodal G' , we have lim^(g-)^oc Pb{Ittq{X) 3 t{9)) = 1 — a. 

Proof, (a) If G G C2, lT^g{X) is given by with probability one. This imphes 
that Pg{lT,„{X) 9 0) = 1 for aU 9. As weh, u{x) > ai(x)+a2(a;)G"^(l-a) implying 
that 

PeilrroiX) 3 t{9)) = Pe{T{9) < u{X)) 

> Pe{r{9) < aiiX) + a2{X)G-\l - a)) 

= PeC-^^^^^^ < G-\l ~ a)) = G(G-i(l ~ a)) ^ 1 - a. 
a2[X) 

(b) If G e C3, iTToiX) is given by (fT4|) with probability one. This implies that 
P6i(/7ro(-'^) 9 0) = for aU 9 (in particular for those 9 such that t{9) = 0). As well, 
since l{x) = ai(a;)-a2(a;)7i((l-a)(l-G(-^^))) > l{x) = ai{x)-a2{x)^i{l-a), 
and similarly uix) < ai{x) + a2(a;)72(l — a), we infer that 

PeihoiX) 3 t{9)) < Pe{ai{X) - a2{x)-f^{l - a) 

< t{9) < ai(X) + 02(^)72(1 - a)) 

= ^e(-7(l - a) < < 72(1 - a)) = 1 - a. 

a2[-^ ) 

(c) First, given that coverage at t{9) = occurs if and only if 1{X) = 0, we have 
for 9 such that t{9) = 

Pg{I.,{X) 3 0) = Pe{l{X) = 0) = Pe C^^^)~^ < Vo) = 1 - G{-yo). 

a2{X) 

For the more general lower bound, the idea here is the same as the one in Theorem 
[U namely to work with a subset (with probability one) I'{X) of /7rg(X) for which 
the coverage of I'{X) is equal to (1 — a)(l — G(— yo))- To achieve this, we first 
establish that 

(15) u{x) > ai{x) + a2(a;)72((l - «)(1 - G(-j/o)))- 

Indeed, if ^^jfy < Vo, then u{x) > ai{x) + a2(a;)G^^((l — a + aG{—yo)) = ai{x) + 

a2(a:)72((l — a)(l — G(— j/o))), using (fTTjl . On the other hand, if 2l[x] — 2/0, then 
([T5|l follows directly as both 72(^) and 1 — G(— z)) increase with z. Similarly, if 
> 2/0, li^) is bounded above by ai{x) - a2(a;)7i((l - a)(l - G(-yo)))- The 
above bounds on l{x) and u{x) imply that the coverage probability of /jro (A") is 
bounded below by the coverage probability of [max(0, ai{x) — a2(x)7i((l — ct){l — 
G{—yo))),ai{x) + a2(2:)72((l — a)(l — G(— yo)))]; or equivalently by the coverage 
probability of 

/'(A) = [ai(a;)-a2(x)7i((l-a)(l-G(-yo))),ai(a;)+a2(x)72((l-a)(l-G(-yo)))]. 
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But finally, using the definition of 71 and 72, assumption ([3]), and the fact that 
-T(X,e)\e) has cdf G, we have Pe{h„{X) 9 t(6')) > Pe{I'{X) 9 t(6')), with 
Pe{I'{X) 3 Tie)) = Pe{ai{X) - a2(^)7i((l - - G(-yo))) < < ai{X) + 
a2(X)72((l - a)(l - G(-yo))) = P.(-7i((l - a){l G{-yo))) < ^^^^(^ < 
72((1 - a)(l - G(-yo))) = (1 - a)(l - G(-yo). 

(d) The result may be established along the lines of part (d) of Theorem[TJ □ 

The next result is a specialization of Theorem [5] to cases where the density G' is 
skewed to the right, or in other words the density of the pivot "^''(^(x")^^'' is skewed 
to the left. Namely, the following result demonstrates that the lower bounds on the 
frequentist coverage probability for symmetric densities G' also apply necessarily 
to right-skewed densities G'. 

Corollary 2. Under the conditions of Theorem\^ suppose further that 
(16) G(-7i(l-z)) < |; for all ze {0,1]; 

(or equivalently G{^"fi{l-' z)) < 1 — G{"f2{l^ z)) by definition of ji and^2)- Then, 
under the assumptions of Theorem\^ we have 

(a) Pe{h„{X) 3 T{e)) > for all 9 such that r(6') > 0; 

(b) Pb{I^„{X) 9 0) > for all 9 such that t{9) = 0. 

Proof. If G G C2, the lower bounds hold of course by virtue of Theorem^ If G € Ci, 
then we must have 

G(-yo) = G(-7i(l - a)(l - G{-yo))) < , 

with the inequality following from (fT6|) . The above now tells us that 1 — G(— j/o) > 
, and parts (a) and (b) follow from Theorem (2] There remains to show that 
G G C3 is incompatible with condition (fTB]) . But, if G € C3 (i.e., G(0) — 1), then 
(HH) cannot hold for z = 1 as G(-7i(0)) = G(0) = 1 > 5. □ 

Remark 2. Corollary [2] includes the particular case of symmetry with equality in 
(fT6|) and, therefore, can be viewed as a generalization of the results of Theorem 
[TJ The lower bounds on coverage probability given in Theorem [2] and Corollary 
[2] correspond to the ones given by Zhang and Woodroofe Q for a lower bounded 
scale parameter of a Fisher distribution, and arising in the estimation of the ratio of 
variance components in a one-way balanced model analysis of variance with random 
effects. 

Example 2 (Lower bounded Gamma scale parameter). As a followup to Example 
(e) of Section 3, consider a Gamma(r, 6*), 9 > a > 0, where X\9 ~ ^/i(f) with 

fi{y) = ^ r(r) " ^(0,00) (y)- Considering the cdf G of -(log(|^) - m), where to is 
chosen in such a way that G' has a mode at 0, which is required in Lemma [5] and 
Theorem [2l we obtain that — (log(^) — logr) has pdf 

G'{y)^^e-^^y+^-'\ 

r(r) 

and cdf G{y) = P(Gamma(r, 1) > re~^). For instance, with the exponential case 
(r = 1), we have G{y) = e~'^ " . Definition [2Is 71(A) and 72(A) satisfy the equation 
-71(A) +e'^i('^) = 72(A) +e^''2(^) with G(72(A)) - G(-7i(A)) = A, but are not 
available explicitly. Hence, as will be the case in general, neither the lower and upper 
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bounds 1{X) and u{X) of Lemma [SJ nor Definition |4ls yo, are available explicitly. 
However, Theorem 2 (parts c and d) do apply. For instance, witfi r = 3, a = 
0.05, a numerical evaluation yields yn ~ 0.9f2968 and 1 — G{~yo) ~ 0,979353, 
which gives the exact coverage at the boundary 9 = a, and which tells us that 
Theorem[5Js lower bound on coverage (f — a){l — G{—yo)) is approximatively equal 
to (0.95) (0.979353) = 0.930386. We were unable to establish but beheve that (dH) 
holds for the cdf 's of this example, which would permit the application of Corollary 
[21 but observe that the lower bound of 0.930386 actually exceeds Corollary[2|s lower 
bound of 2:|t = 0.904762. 

1.05 

We conclude by pointing out that the results of this paper do leave open several 
questions concerning further coverage probability properties of the Bayesian confi- 
dence interval /^^(X). Namely, as seen in the above example, it would be desirable 
for the quantity 1 — G{—yo) of Theorem [2] to be made more explicit. Further nu- 
merical evaluations of 1 — G(— j/o)i which also suggest quite high lower bounds on 
coverage, are given by Zhang and Woodroofe [8] in their particular case of a lower 
bounded Fisher distribution scale parameter. 
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